Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 5405 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.8 MiB |
| Average record size in memory | 355.6 B |
Variable types
| Categorical | 4 |
|---|---|
| Numeric | 12 |
SERIES has constant value "EQ" | Constant |
DATE has a high cardinality: 1081 distinct values | High cardinality |
OPEN is highly correlated with HIGH and 7 other fields | High correlation |
HIGH is highly correlated with OPEN and 7 other fields | High correlation |
LOW is highly correlated with OPEN and 7 other fields | High correlation |
PREV. CLOSE is highly correlated with OPEN and 7 other fields | High correlation |
LTP is highly correlated with OPEN and 7 other fields | High correlation |
CLOSE is highly correlated with OPEN and 7 other fields | High correlation |
VWAP is highly correlated with OPEN and 7 other fields | High correlation |
52W H is highly correlated with OPEN and 8 other fields | High correlation |
52W L is highly correlated with OPEN and 8 other fields | High correlation |
VOLUME is highly correlated with 52W H and 3 other fields | High correlation |
VALUE is highly correlated with VOLUME and 1 other fields | High correlation |
NO OF TRADES is highly correlated with VOLUME and 1 other fields | High correlation |
OPEN is highly correlated with HIGH and 7 other fields | High correlation |
HIGH is highly correlated with OPEN and 7 other fields | High correlation |
LOW is highly correlated with OPEN and 7 other fields | High correlation |
PREV. CLOSE is highly correlated with OPEN and 7 other fields | High correlation |
LTP is highly correlated with OPEN and 7 other fields | High correlation |
CLOSE is highly correlated with OPEN and 7 other fields | High correlation |
VWAP is highly correlated with OPEN and 7 other fields | High correlation |
52W H is highly correlated with OPEN and 7 other fields | High correlation |
52W L is highly correlated with OPEN and 7 other fields | High correlation |
VOLUME is highly correlated with NO OF TRADES | High correlation |
VALUE is highly correlated with NO OF TRADES | High correlation |
NO OF TRADES is highly correlated with VOLUME and 1 other fields | High correlation |
OPEN is highly correlated with HIGH and 7 other fields | High correlation |
HIGH is highly correlated with OPEN and 7 other fields | High correlation |
LOW is highly correlated with OPEN and 7 other fields | High correlation |
PREV. CLOSE is highly correlated with OPEN and 7 other fields | High correlation |
LTP is highly correlated with OPEN and 7 other fields | High correlation |
CLOSE is highly correlated with OPEN and 7 other fields | High correlation |
VWAP is highly correlated with OPEN and 7 other fields | High correlation |
52W H is highly correlated with OPEN and 7 other fields | High correlation |
52W L is highly correlated with OPEN and 7 other fields | High correlation |
VALUE is highly correlated with NO OF TRADES | High correlation |
NO OF TRADES is highly correlated with VALUE | High correlation |
Source.Name is highly correlated with SYMBOL and 1 other fields | High correlation |
SYMBOL is highly correlated with Source.Name and 1 other fields | High correlation |
SERIES is highly correlated with Source.Name and 1 other fields | High correlation |
Source.Name is highly correlated with OPEN and 11 other fields | High correlation |
OPEN is highly correlated with Source.Name and 9 other fields | High correlation |
HIGH is highly correlated with Source.Name and 9 other fields | High correlation |
LOW is highly correlated with Source.Name and 9 other fields | High correlation |
PREV. CLOSE is highly correlated with Source.Name and 9 other fields | High correlation |
LTP is highly correlated with Source.Name and 9 other fields | High correlation |
CLOSE is highly correlated with Source.Name and 9 other fields | High correlation |
VWAP is highly correlated with Source.Name and 9 other fields | High correlation |
52W H is highly correlated with Source.Name and 9 other fields | High correlation |
52W L is highly correlated with Source.Name and 9 other fields | High correlation |
VOLUME is highly correlated with Source.Name and 2 other fields | High correlation |
VALUE is highly correlated with NO OF TRADES | High correlation |
NO OF TRADES is highly correlated with Source.Name and 3 other fields | High correlation |
SYMBOL is highly correlated with Source.Name and 11 other fields | High correlation |
Source.Name is uniformly distributed | Uniform |
DATE is uniformly distributed | Uniform |
SYMBOL is uniformly distributed | Uniform |
VALUE has unique values | Unique |
Reproduction
| Analysis started | 2022-07-31 17:07:30.429024 |
|---|---|
| Analysis finished | 2022-07-31 17:07:51.678789 |
| Duration | 21.25 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 373.8 KiB |
| SBIN_Data.csv | |
|---|---|
| WIPRO_Data.csv | |
| TCS_Data.csv | |
| TATA_Data.csv | |
| RELIANCE_Data.csv |
Length
| Max length | 17 |
|---|---|
| Median length | 14 |
| Mean length | 13.8 |
| Min length | 12 |
Characters and Unicode
| Total characters | 74589 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | RELIANCE_Data.csv |
|---|---|
| 2nd row | RELIANCE_Data.csv |
| 3rd row | RELIANCE_Data.csv |
| 4th row | RELIANCE_Data.csv |
| 5th row | RELIANCE_Data.csv |
Common Values
| Value | Count | Frequency (%) |
| SBIN_Data.csv | 1081 | |
| WIPRO_Data.csv | 1081 | |
| TCS_Data.csv | 1081 | |
| TATA_Data.csv | 1081 | |
| RELIANCE_Data.csv | 1081 |
Length
Histogram of lengths of the category
Category Frequency Plot
| Value | Count | Frequency (%) |
| sbin_data.csv | 1081 | |
| wipro_data.csv | 1081 | |
| tcs_data.csv | 1081 | |
| tata_data.csv | 1081 | |
| reliance_data.csv | 1081 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 10810 | |
| s | 5405 | 7.2% |
| _ | 5405 | 7.2% |
| D | 5405 | 7.2% |
| t | 5405 | 7.2% |
| . | 5405 | 7.2% |
| c | 5405 | 7.2% |
| v | 5405 | 7.2% |
| I | 3243 | 4.3% |
| A | 3243 | 4.3% |
| Other values (11) | 19458 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 32430 | |
| Uppercase Letter | 31349 | |
| Connector Punctuation | 5405 | 7.2% |
| Other Punctuation | 5405 | 7.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 5405 | |
| I | 3243 | |
| A | 3243 | |
| T | 3243 | |
| R | 2162 | 6.9% |
| E | 2162 | 6.9% |
| C | 2162 | 6.9% |
| S | 2162 | 6.9% |
| N | 2162 | 6.9% |
| O | 1081 | 3.4% |
| Other values (4) | 4324 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 10810 | |
| s | 5405 | |
| t | 5405 | |
| c | 5405 | |
| v | 5405 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 5405 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5405 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 63779 | |
| Common | 10810 | 14.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 10810 | |
| s | 5405 | 8.5% |
| D | 5405 | 8.5% |
| t | 5405 | 8.5% |
| c | 5405 | 8.5% |
| v | 5405 | 8.5% |
| I | 3243 | 5.1% |
| A | 3243 | 5.1% |
| T | 3243 | 5.1% |
| R | 2162 | 3.4% |
| Other values (9) | 14053 |
Common
| Value | Count | Frequency (%) |
| _ | 5405 | |
| . | 5405 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 74589 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 10810 | |
| s | 5405 | 7.2% |
| _ | 5405 | 7.2% |
| D | 5405 | 7.2% |
| t | 5405 | 7.2% |
| . | 5405 | 7.2% |
| c | 5405 | 7.2% |
| v | 5405 | 7.2% |
| I | 3243 | 4.3% |
| A | 3243 | 4.3% |
| Other values (11) | 19458 |
| Distinct | 1081 |
|---|---|
| Distinct (%) | 20.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 353.8 KiB |
| 11-03-2022 | 5 |
|---|---|
| 28-03-2018 | 5 |
| 19-11-2020 | 5 |
| 21-05-2021 | 5 |
| 15-02-2019 | 5 |
| Other values (1076) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 54050 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 16-05-2022 |
|---|---|
| 2nd row | 13-05-2022 |
| 3rd row | 12-05-2022 |
| 4th row | 11-05-2022 |
| 5th row | 10-05-2022 |
Common Values
| Value | Count | Frequency (%) |
| 11-03-2022 | 5 | 0.1% |
| 28-03-2018 | 5 | 0.1% |
| 19-11-2020 | 5 | 0.1% |
| 21-05-2021 | 5 | 0.1% |
| 15-02-2019 | 5 | 0.1% |
| 11-11-2019 | 5 | 0.1% |
| 04-10-2019 | 5 | 0.1% |
| 23-12-2020 | 5 | 0.1% |
| 24-03-2021 | 5 | 0.1% |
| 03-04-2020 | 5 | 0.1% |
| Other values (1071) | 5355 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| 11-03-2022 | 5 | 0.1% |
| 03-03-2021 | 5 | 0.1% |
| 26-11-2020 | 5 | 0.1% |
| 31-03-2021 | 5 | 0.1% |
| 12-08-2020 | 5 | 0.1% |
| 23-10-2020 | 5 | 0.1% |
| 28-08-2018 | 5 | 0.1% |
| 08-01-2019 | 5 | 0.1% |
| 17-06-2020 | 5 | 0.1% |
| 04-06-2018 | 5 | 0.1% |
| Other values (1071) | 5355 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 13365 | |
| 2 | 11995 | |
| - | 10810 | |
| 1 | 8280 | |
| 8 | 2185 | 4.0% |
| 9 | 2140 | 4.0% |
| 3 | 1310 | 2.4% |
| 4 | 1030 | 1.9% |
| 7 | 995 | 1.8% |
| 5 | 985 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 43240 | |
| Dash Punctuation | 10810 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 13365 | |
| 2 | 11995 | |
| 1 | 8280 | |
| 8 | 2185 | 5.1% |
| 9 | 2140 | 4.9% |
| 3 | 1310 | 3.0% |
| 4 | 1030 | 2.4% |
| 7 | 995 | 2.3% |
| 5 | 985 | 2.3% |
| 6 | 955 | 2.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 10810 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 54050 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 13365 | |
| 2 | 11995 | |
| - | 10810 | |
| 1 | 8280 | |
| 8 | 2185 | 4.0% |
| 9 | 2140 | 4.0% |
| 3 | 1310 | 2.4% |
| 4 | 1030 | 1.9% |
| 7 | 995 | 1.8% |
| 5 | 985 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 54050 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 13365 | |
| 2 | 11995 | |
| - | 10810 | |
| 1 | 8280 | |
| 8 | 2185 | 4.0% |
| 9 | 2140 | 4.0% |
| 3 | 1310 | 2.4% |
| 4 | 1030 | 1.9% |
| 7 | 995 | 1.8% |
| 5 | 985 | 1.8% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 311.5 KiB |
| EQ |
|---|
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 10810 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EQ |
|---|---|
| 2nd row | EQ |
| 3rd row | EQ |
| 4th row | EQ |
| 5th row | EQ |
Common Values
| Value | Count | Frequency (%) |
| EQ | 5405 |
Length
Histogram of lengths of the category
Category Frequency Plot
| Value | Count | Frequency (%) |
| eq | 5405 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 5405 | |
| Q | 5405 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10810 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 5405 | |
| Q | 5405 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10810 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 5405 | |
| Q | 5405 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10810 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 5405 | |
| Q | 5405 |
| Distinct | 4125 |
|---|---|
| Distinct (%) | 76.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1132.839315 |
| Minimum | 151.95 |
|---|---|
| Maximum | 4033.95 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 42.4 KiB |
Quantile statistics
| Minimum | 151.95 |
|---|---|
| 5-th percentile | 237.08 |
| Q1 | 326 |
| median | 606.9 |
| Q3 | 1956.5 |
| 95-th percentile | 3210.04 |
| Maximum | 4033.95 |
| Range | 3882 |
| Interquartile range (IQR) | 1630.5 |
Descriptive statistics
| Standard deviation | 984.9557937 |
|---|---|
| Coefficient of variation (CV) | 0.8694576364 |
| Kurtosis | -0.03213369956 |
| Mean | 1132.839315 |
| Median Absolute Deviation (MAD) | 358.3 |
| Skewness | 1.023041386 |
| Sum | 6122996.5 |
| Variance | 970137.9154 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 285 | 11 | 0.2% |
| 268 | 9 | 0.2% |
| 254 | 9 | 0.2% |
| 2085 | 8 | 0.1% |
| 2060 | 8 | 0.1% |
| 275 | 8 | 0.1% |
| 290 | 8 | 0.1% |
| 283 | 7 | 0.1% |
| 1980 | 7 | 0.1% |
| 2174 | 7 | 0.1% |
| Other values (4115) | 5323 |
| Value | Count | Frequency (%) |
| 151.95 | 1 | |
| 152 | 1 | |
| 152.4 | 1 | |
| 153 | 1 | |
| 153.65 | 1 | |
| 156.1 | 1 | |
| 157.5 | 1 | |
| 159.45 | 1 | |
| 163.1 | 1 | |
| 164 | 1 |
| Value | Count | Frequency (%) |
| 4033.95 | 1 | |
| 4012 | 1 | |
| 3992.7 | 1 | |
| 3978 | 1 | |
| 3930 | 1 | |
| 3925 | 2 | |
| 3920 | 1 | |
| 3918 | 1 | |
| 3910 | 1 | |
| 3900 | 1 |
| Distinct | 4556 |
|---|---|
| Distinct (%) | 84.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1146.285708 |
| Minimum | 153.2 |
|---|---|
| Maximum | 4043 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 42.4 KiB |
Quantile statistics
| Minimum | 153.2 |
|---|---|
| 5-th percentile | 239.66 |
| Q1 | 331.35 |
| median | 614 |
| Q3 | 1978 |
| 95-th percentile | 3230.8 |
| Maximum | 4043 |
| Range | 3889.8 |
| Interquartile range (IQR) | 1646.65 |
Descriptive statistics
| Standard deviation | 994.7326818 |
|---|---|
| Coefficient of variation (CV) | 0.8677877383 |
| Kurtosis | -0.04414565517 |
| Mean | 1146.285708 |
| Median Absolute Deviation (MAD) | 362.3 |
| Skewness | 1.019054466 |
| Sum | 6195674.25 |
| Variance | 989493.1082 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 288 | 6 | 0.1% |
| 365 | 5 | 0.1% |
| 320 | 5 | 0.1% |
| 289 | 5 | 0.1% |
| 298 | 5 | 0.1% |
| 262 | 5 | 0.1% |
| 295 | 5 | 0.1% |
| 2165 | 5 | 0.1% |
| 293.8 | 5 | 0.1% |
| 525 | 4 | 0.1% |
| Other values (4546) | 5355 |
| Value | Count | Frequency (%) |
| 153.2 | 1 | |
| 155.25 | 1 | |
| 155.6 | 1 | |
| 156.15 | 1 | |
| 157.85 | 1 | |
| 160.8 | 1 | |
| 161.9 | 1 | |
| 162.4 | 1 | |
| 166.4 | 1 | |
| 168.25 | 1 |
| Value | Count | Frequency (%) |
| 4043 | 1 | |
| 4041.7 | 1 | |
| 4012 | 1 | |
| 3989.9 | 1 | |
| 3981.75 | 1 | |
| 3980 | 1 | |
| 3978 | 1 | |
| 3977 | 1 | |
| 3945 | 1 | |
| 3944.4 | 1 |
| Distinct | 4614 |
|---|---|
| Distinct (%) | 85.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1118.388936 |
| Minimum | 149.45 |
|---|---|
| Maximum | 3980 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 42.4 KiB |
Quantile statistics
| Minimum | 149.45 |
|---|---|
| 5-th percentile | 234.26 |
| Q1 | 322.1 |
| median | 598 |
| Q3 | 1930.4 |
| 95-th percentile | 3180.8 |
| Maximum | 3980 |
| Range | 3830.55 |
| Interquartile range (IQR) | 1608.3 |
Descriptive statistics
| Standard deviation | 974.7068185 |
|---|---|
| Coefficient of variation (CV) | 0.8715275938 |
| Kurtosis | -0.02060556082 |
| Mean | 1118.388936 |
| Median Absolute Deviation (MAD) | 352 |
| Skewness | 1.027759066 |
| Sum | 6044892.2 |
| Variance | 950053.382 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 280 | 7 | 0.1% |
| 311 | 6 | 0.1% |
| 255 | 6 | 0.1% |
| 1985 | 5 | 0.1% |
| 258.1 | 5 | 0.1% |
| 270 | 5 | 0.1% |
| 358 | 5 | 0.1% |
| 300 | 5 | 0.1% |
| 323 | 4 | 0.1% |
| 246.6 | 4 | 0.1% |
| Other values (4604) | 5353 |
| Value | Count | Frequency (%) |
| 149.45 | 1 | |
| 150.2 | 1 | |
| 150.8 | 1 | |
| 151.15 | 1 | |
| 151.5 | 1 | |
| 152.4 | 1 | |
| 155 | 1 | |
| 155.2 | 1 | |
| 156.7 | 1 | |
| 159.4 | 1 |
| Value | Count | Frequency (%) |
| 3980 | 1 | |
| 3962.3 | 1 | |
| 3910.5 | 1 | |
| 3900 | 1 | |
| 3892.1 | 1 | |
| 3866 | 1 | |
| 3861 | 1 | |
| 3860.05 | 1 | |
| 3857 | 1 | |
| 3856 | 1 |
| Distinct | 4843 |
|---|---|
| Distinct (%) | 89.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1131.404746 |
| Minimum | 150.85 |
|---|---|
| Maximum | 4019.15 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 42.4 KiB |
Quantile statistics
| Minimum | 150.85 |
|---|---|
| 5-th percentile | 237.06 |
| Q1 | 326.15 |
| median | 605.6 |
| Q3 | 1953.7 |
| 95-th percentile | 3200.13 |
| Maximum | 4019.15 |
| Range | 3868.3 |
| Interquartile range (IQR) | 1627.55 |
Descriptive statistics
| Standard deviation | 984.1283756 |
|---|---|
| Coefficient of variation (CV) | 0.86982875 |
| Kurtosis | -0.02914544749 |
| Mean | 1131.404746 |
| Median Absolute Deviation (MAD) | 357.5 |
| Skewness | 1.024434972 |
| Sum | 6115242.65 |
| Variance | 968508.6596 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 286.4 | 5 | 0.1% |
| 285.05 | 5 | 0.1% |
| 276.2 | 4 | 0.1% |
| 326.7 | 4 | 0.1% |
| 248 | 4 | 0.1% |
| 282.85 | 4 | 0.1% |
| 285.3 | 4 | 0.1% |
| 287.7 | 4 | 0.1% |
| 273.3 | 3 | 0.1% |
| 261.65 | 3 | 0.1% |
| Other values (4833) | 5365 |
| Value | Count | Frequency (%) |
| 150.85 | 1 | |
| 151.4 | 1 | |
| 151.95 | 1 | |
| 152.8 | 1 | |
| 153.4 | 1 | |
| 155.3 | 1 | |
| 158.2 | 1 | |
| 158.6 | 1 | |
| 161.3 | 1 | |
| 162.35 | 1 |
| Value | Count | Frequency (%) |
| 4019.15 | 1 | |
| 3990.6 | 1 | |
| 3968.15 | 1 | |
| 3954.55 | 1 | |
| 3935.65 | 1 | |
| 3915.9 | 1 | |
| 3914.65 | 1 | |
| 3903.3 | 1 | |
| 3897.9 | 1 | |
| 3892.9 | 1 |
| Distinct | 4529 |
|---|---|
| Distinct (%) | 83.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1131.992877 |
| Minimum | 151.1 |
|---|---|
| Maximum | 4025 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 42.4 KiB |
Quantile statistics
| Minimum | 151.1 |
|---|---|
| 5-th percentile | 237.16 |
| Q1 | 326.05 |
| median | 605.5 |
| Q3 | 1955 |
| 95-th percentile | 3200.28 |
| Maximum | 4025 |
| Range | 3873.9 |
| Interquartile range (IQR) | 1628.95 |
Descriptive statistics
| Standard deviation | 984.5162875 |
|---|---|
| Coefficient of variation (CV) | 0.8697195076 |
| Kurtosis | -0.03094494906 |
| Mean | 1131.992877 |
| Median Absolute Deviation (MAD) | 357.4 |
| Skewness | 1.023829071 |
| Sum | 6118421.5 |
| Variance | 969272.3204 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 267 | 7 | 0.1% |
| 433 | 7 | 0.1% |
| 260 | 6 | 0.1% |
| 515 | 5 | 0.1% |
| 324 | 5 | 0.1% |
| 284.5 | 5 | 0.1% |
| 2250 | 5 | 0.1% |
| 2112 | 5 | 0.1% |
| 286.55 | 5 | 0.1% |
| 2000 | 5 | 0.1% |
| Other values (4519) | 5350 |
| Value | Count | Frequency (%) |
| 151.1 | 1 | |
| 151.6 | 1 | |
| 152 | 1 | |
| 153.05 | 1 | |
| 153.6 | 1 | |
| 155.45 | 1 | |
| 157.9 | 1 | |
| 158.6 | 1 | |
| 161.3 | 1 | |
| 161.8 | 1 |
| Value | Count | Frequency (%) |
| 4025 | 1 | |
| 3993.95 | 1 | |
| 3965.35 | 1 | |
| 3950 | 1 | |
| 3943 | 1 | |
| 3917 | 1 | |
| 3914.7 | 1 | |
| 3903.05 | 1 | |
| 3898 | 2 | |
| 3884 | 1 |
| Distinct | 4842 |
|---|---|
| Distinct (%) | 89.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1131.93235 |
| Minimum | 150.85 |
|---|---|
| Maximum | 4019.15 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 42.4 KiB |
Quantile statistics
| Minimum | 150.85 |
|---|---|
| 5-th percentile | 237.06 |
| Q1 | 326.2 |
| median | 605.6 |
| Q3 | 1954.05 |
| 95-th percentile | 3200.23 |
| Maximum | 4019.15 |
| Range | 3868.3 |
| Interquartile range (IQR) | 1627.85 |
Descriptive statistics
| Standard deviation | 984.4674775 |
|---|---|
| Coefficient of variation (CV) | 0.8697228927 |
| Kurtosis | -0.03116874752 |
| Mean | 1131.93235 |
| Median Absolute Deviation (MAD) | 357.6 |
| Skewness | 1.023747259 |
| Sum | 6118094.35 |
| Variance | 969176.2143 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 285.05 | 5 | 0.1% |
| 286.4 | 5 | 0.1% |
| 285.3 | 4 | 0.1% |
| 287.7 | 4 | 0.1% |
| 282.85 | 4 | 0.1% |
| 276.2 | 4 | 0.1% |
| 326.7 | 4 | 0.1% |
| 248 | 4 | 0.1% |
| 261.9 | 3 | 0.1% |
| 265.75 | 3 | 0.1% |
| Other values (4832) | 5365 |
| Value | Count | Frequency (%) |
| 150.85 | 1 | |
| 151.4 | 1 | |
| 151.95 | 1 | |
| 152.8 | 1 | |
| 153.4 | 1 | |
| 155.3 | 1 | |
| 158.2 | 1 | |
| 158.6 | 1 | |
| 161.3 | 1 | |
| 162.35 | 1 |
| Value | Count | Frequency (%) |
| 4019.15 | 1 | |
| 3990.6 | 1 | |
| 3968.15 | 1 | |
| 3954.55 | 1 | |
| 3935.65 | 1 | |
| 3915.9 | 1 | |
| 3914.65 | 1 | |
| 3903.3 | 1 | |
| 3897.9 | 1 | |
| 3892.9 | 1 |
| Distinct | 5286 |
|---|---|
| Distinct (%) | 97.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1132.543406 |
| Minimum | 151.82 |
|---|---|
| Maximum | 4010.33 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 42.4 KiB |
Quantile statistics
| Minimum | 151.82 |
|---|---|
| 5-th percentile | 237.192 |
| Q1 | 326.36 |
| median | 605.74 |
| Q3 | 1952.66 |
| 95-th percentile | 3206.504 |
| Maximum | 4010.33 |
| Range | 3858.51 |
| Interquartile range (IQR) | 1626.3 |
Descriptive statistics
| Standard deviation | 984.8408792 |
|---|---|
| Coefficient of variation (CV) | 0.8695833413 |
| Kurtosis | -0.03231240056 |
| Mean | 1132.543406 |
| Median Absolute Deviation (MAD) | 357.26 |
| Skewness | 1.023334018 |
| Sum | 6121397.11 |
| Variance | 969911.5574 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 279.14 | 3 | 0.1% |
| 424.32 | 3 | 0.1% |
| 270.13 | 3 | 0.1% |
| 250.07 | 3 | 0.1% |
| 260.67 | 2 | < 0.1% |
| 1115.07 | 2 | < 0.1% |
| 312.15 | 2 | < 0.1% |
| 2043.13 | 2 | < 0.1% |
| 342.86 | 2 | < 0.1% |
| 265.47 | 2 | < 0.1% |
| Other values (5276) | 5381 |
| Value | Count | Frequency (%) |
| 151.82 | 1 | |
| 151.9 | 1 | |
| 152.99 | 1 | |
| 154.02 | 1 | |
| 154.9 | 1 | |
| 155.74 | 1 | |
| 157.74 | 1 | |
| 159.32 | 1 | |
| 159.38 | 1 | |
| 163.7 | 1 |
| Value | Count | Frequency (%) |
| 4010.33 | 1 | |
| 4009.82 | 1 | |
| 3947.83 | 1 | |
| 3937.16 | 1 | |
| 3931.95 | 1 | |
| 3929.4 | 1 | |
| 3924.25 | 1 | |
| 3908.95 | 1 | |
| 3897.84 | 1 | |
| 3896.96 | 1 |
| Distinct | 345 |
|---|---|
| Distinct (%) | 6.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1366.202303 |
| Minimum | 276.15 |
|---|---|
| Maximum | 4043 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 42.4 KiB |
Quantile statistics
| Minimum | 276.15 |
|---|---|
| 5-th percentile | 334 |
| Q1 | 388.95 |
| median | 782.5 |
| Q3 | 2296.2 |
| 95-th percentile | 3674.8 |
| Maximum | 4043 |
| Range | 3766.85 |
| Interquartile range (IQR) | 1907.25 |
Descriptive statistics
| Standard deviation | 1135.910053 |
|---|---|
| Coefficient of variation (CV) | 0.8314362009 |
| Kurtosis | -0.4255731308 |
| Mean | 1366.202303 |
| Median Absolute Deviation (MAD) | 448.5 |
| Skewness | 0.9306345768 |
| Sum | 7384323.45 |
| Variance | 1290291.648 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 793 | 250 | 4.6% |
| 388.95 | 248 | 4.6% |
| 3674.8 | 246 | 4.6% |
| 373.8 | 246 | 4.6% |
| 2369.35 | 240 | 4.4% |
| 2296.2 | 207 | 3.8% |
| 351.3 | 203 | 3.8% |
| 1534.5 | 185 | 3.4% |
| 1664.9 | 149 | 2.8% |
| 739.85 | 144 | 2.7% |
| Other values (335) | 3287 |
| Value | Count | Frequency (%) |
| 276.15 | 11 | 0.2% |
| 281.6 | 7 | 0.1% |
| 285.6 | 5 | 0.1% |
| 286.8 | 1 | < 0.1% |
| 288 | 5 | 0.1% |
| 290.8 | 30 | |
| 298.45 | 1 | < 0.1% |
| 300.75 | 5 | 0.1% |
| 301.6 | 63 | |
| 308.55 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 4043 | 80 | |
| 3989.9 | 68 | |
| 3981.75 | 16 | 0.3% |
| 3980 | 1 | < 0.1% |
| 3896.5 | 1 | < 0.1% |
| 3877.6 | 5 | 0.1% |
| 3859.15 | 2 | < 0.1% |
| 3816.7 | 1 | < 0.1% |
| 3804.1 | 1 | < 0.1% |
| 3740.35 | 1 | < 0.1% |
| Distinct | 271 |
|---|---|
| Distinct (%) | 5.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 778.7434598 |
| Minimum | 149.45 |
|---|---|
| Maximum | 3036 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 42.4 KiB |
Quantile statistics
| Minimum | 149.45 |
|---|---|
| 5-th percentile | 159.4 |
| Q1 | 245.05 |
| median | 412.6 |
| Q3 | 1143 |
| 95-th percentile | 2214.95 |
| Maximum | 3036 |
| Range | 2886.55 |
| Interquartile range (IQR) | 897.95 |
Descriptive statistics
| Standard deviation | 714.4705418 |
|---|---|
| Coefficient of variation (CV) | 0.9174658649 |
| Kurtosis | 0.3068618772 |
| Mean | 778.7434598 |
| Median Absolute Deviation (MAD) | 253.2 |
| Skewness | 1.155422683 |
| Sum | 4209108.4 |
| Variance | 510468.1552 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 875.65 | 254 | 4.7% |
| 250.85 | 252 | 4.7% |
| 159.4 | 251 | 4.6% |
| 149.45 | 250 | 4.6% |
| 1506.05 | 250 | 4.6% |
| 1711.15 | 248 | 4.6% |
| 232.35 | 247 | 4.6% |
| 779.1 | 185 | 3.4% |
| 253.5 | 175 | 3.2% |
| 1830 | 133 | 2.5% |
| Other values (261) | 3160 |
| Value | Count | Frequency (%) |
| 149.45 | 250 | |
| 150.2 | 5 | 0.1% |
| 151.15 | 2 | < 0.1% |
| 152.4 | 1 | < 0.1% |
| 155 | 1 | < 0.1% |
| 159.4 | 251 | |
| 160.85 | 4 | 0.1% |
| 163.35 | 5 | 0.1% |
| 165 | 6 | 0.1% |
| 166.1 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3036 | 6 | 0.1% |
| 3004 | 32 | |
| 2987.05 | 9 | 0.2% |
| 2901.8 | 4 | 0.1% |
| 2880 | 39 | |
| 2845 | 5 | 0.1% |
| 2785 | 5 | 0.1% |
| 2755 | 5 | 0.1% |
| 2706.15 | 5 | 0.1% |
| 2624.45 | 10 | 0.2% |
| Distinct | 5403 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13832333.99 |
| Minimum | 142541 |
|---|---|
| Maximum | 214955688 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 42.4 KiB |
Quantile statistics
| Minimum | 142541 |
|---|---|
| 5-th percentile | 1724797.6 |
| Q1 | 4080283 |
| median | 8009017 |
| Q3 | 16244724 |
| 95-th percentile | 47631032.6 |
| Maximum | 214955688 |
| Range | 214813147 |
| Interquartile range (IQR) | 12164441 |
Descriptive statistics
| Standard deviation | 17402816.85 |
|---|---|
| Coefficient of variation (CV) | 1.258125843 |
| Kurtosis | 20.05581883 |
| Mean | 13832333.99 |
| Median Absolute Deviation (MAD) | 4930624 |
| Skewness | 3.60827038 |
| Sum | 7.47637652 × 1010 |
| Variance | 3.028580344 × 1014 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 3755507 | 2 | < 0.1% |
| 17972207 | 2 | < 0.1% |
| 5572177 | 1 | < 0.1% |
| 7707953 | 1 | < 0.1% |
| 6311227 | 1 | < 0.1% |
| 12434745 | 1 | < 0.1% |
| 3165495 | 1 | < 0.1% |
| 20249909 | 1 | < 0.1% |
| 6431244 | 1 | < 0.1% |
| 8414514 | 1 | < 0.1% |
| Other values (5393) | 5393 |
| Value | Count | Frequency (%) |
| 142541 | 1 | |
| 144530 | 1 | |
| 224421 | 1 | |
| 298819 | 1 | |
| 315376 | 1 | |
| 328991 | 1 | |
| 413871 | 1 | |
| 456541 | 1 | |
| 564509 | 1 | |
| 576853 | 1 |
| Value | Count | Frequency (%) |
| 214955688 | 1 | |
| 201325176 | 1 | |
| 192810772 | 1 | |
| 157820882 | 1 | |
| 155703140 | 1 | |
| 151750421 | 1 | |
| 149245973 | 1 | |
| 145903102 | 1 | |
| 145203439 | 1 | |
| 143083196 | 1 |
| Distinct | 5405 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9375572682 |
| Minimum | 46436853 |
|---|---|
| Maximum | 1.47 × 1011 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 42.4 KiB |
Quantile statistics
| Minimum | 46436853 |
|---|---|
| 5-th percentile | 918073969.8 |
| Q1 | 3955927032 |
| median | 6757642220 |
| Q3 | 1.159632556 × 1010 |
| 95-th percentile | 2.621089946 × 1010 |
| Maximum | 1.47 × 1011 |
| Range | 1.469535631 × 1011 |
| Interquartile range (IQR) | 7640398529 |
Descriptive statistics
| Standard deviation | 9629199823 |
|---|---|
| Coefficient of variation (CV) | 1.027051909 |
| Kurtosis | 30.29905043 |
| Mean | 9375572682 |
| Median Absolute Deviation (MAD) | 3428755458 |
| Skewness | 4.0523113 |
| Sum | 5.067497035 × 1013 |
| Variance | 9.272148924 × 1019 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1.356160157 × 1010 | 1 | < 0.1% |
| 1.80245977 × 1010 | 1 | < 0.1% |
| 4729227902 | 1 | < 0.1% |
| 9144123989 | 1 | < 0.1% |
| 5431995775 | 1 | < 0.1% |
| 4461505135 | 1 | < 0.1% |
| 7530602219 | 1 | < 0.1% |
| 1.566004488 × 1010 | 1 | < 0.1% |
| 5302849983 | 1 | < 0.1% |
| 6845341476 | 1 | < 0.1% |
| Other values (5395) | 5395 |
| Value | Count | Frequency (%) |
| 46436853 | 1 | |
| 80082326 | 1 | |
| 192142151 | 1 | |
| 240131505 | 1 | |
| 247693013 | 1 | |
| 259105311 | 1 | |
| 279518393 | 1 | |
| 280714185 | 1 | |
| 287560484 | 1 | |
| 289781591 | 1 |
| Value | Count | Frequency (%) |
| 1.47 × 1011 | 1 | |
| 1.27 × 1011 | 1 | |
| 1.24 × 1011 | 1 | |
| 1.18 × 1011 | 1 | |
| 9.179980463 × 1010 | 1 | |
| 8.912049492 × 1010 | 1 | |
| 8.839332015 × 1010 | 1 | |
| 8.835029614 × 1010 | 1 | |
| 8.750594271 × 1010 | 1 | |
| 8.549082574 × 1010 | 1 |
| Distinct | 5366 |
|---|---|
| Distinct (%) | 99.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 174616.0834 |
| Minimum | 2593 |
|---|---|
| Maximum | 1428490 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 42.4 KiB |
Quantile statistics
| Minimum | 2593 |
|---|---|
| 5-th percentile | 43213.8 |
| Q1 | 96681 |
| median | 143925 |
| Q3 | 216493 |
| 95-th percentile | 412247.8 |
| Maximum | 1428490 |
| Range | 1425897 |
| Interquartile range (IQR) | 119812 |
Descriptive statistics
| Standard deviation | 125243.3489 |
|---|---|
| Coefficient of variation (CV) | 0.7172497883 |
| Kurtosis | 11.21579501 |
| Mean | 174616.0834 |
| Median Absolute Deviation (MAD) | 55773 |
| Skewness | 2.498468505 |
| Sum | 943799931 |
| Variance | 1.568589644 × 1010 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 31057 | 2 | < 0.1% |
| 138967 | 2 | < 0.1% |
| 136133 | 2 | < 0.1% |
| 91094 | 2 | < 0.1% |
| 106325 | 2 | < 0.1% |
| 76911 | 2 | < 0.1% |
| 77874 | 2 | < 0.1% |
| 162870 | 2 | < 0.1% |
| 175976 | 2 | < 0.1% |
| 170796 | 2 | < 0.1% |
| Other values (5356) | 5385 |
| Value | Count | Frequency (%) |
| 2593 | 1 | |
| 6533 | 1 | |
| 7892 | 1 | |
| 9190 | 1 | |
| 9412 | 1 | |
| 9567 | 1 | |
| 11831 | 1 | |
| 12377 | 1 | |
| 12576 | 1 | |
| 13145 | 1 |
| Value | Count | Frequency (%) |
| 1428490 | 1 | |
| 1285533 | 1 | |
| 1233053 | 1 | |
| 1194059 | 1 | |
| 1155236 | 1 | |
| 1154959 | 1 | |
| 1121959 | 1 | |
| 1078097 | 1 | |
| 990935 | 1 | |
| 908228 | 1 |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 331.6 KiB |
| RELIANCE | |
|---|---|
| TATASTEEL | |
| SBIN | |
| WIPRO | |
| TCS |
Length
| Max length | 9 |
|---|---|
| Median length | 5 |
| Mean length | 5.8 |
| Min length | 3 |
Characters and Unicode
| Total characters | 31349 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | RELIANCE |
|---|---|
| 2nd row | RELIANCE |
| 3rd row | RELIANCE |
| 4th row | RELIANCE |
| 5th row | RELIANCE |
Common Values
| Value | Count | Frequency (%) |
| RELIANCE | 1081 | |
| TATASTEEL | 1081 | |
| SBIN | 1081 | |
| WIPRO | 1081 | |
| TCS | 1081 |
Length
Histogram of lengths of the category
Category Frequency Plot
| Value | Count | Frequency (%) |
| reliance | 1081 | |
| tatasteel | 1081 | |
| sbin | 1081 | |
| wipro | 1081 | |
| tcs | 1081 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 4324 | |
| T | 4324 | |
| I | 3243 | |
| A | 3243 | |
| S | 3243 | |
| R | 2162 | |
| L | 2162 | |
| N | 2162 | |
| C | 2162 | |
| B | 1081 | 3.4% |
| Other values (3) | 3243 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 31349 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 4324 | |
| T | 4324 | |
| I | 3243 | |
| A | 3243 | |
| S | 3243 | |
| R | 2162 | |
| L | 2162 | |
| N | 2162 | |
| C | 2162 | |
| B | 1081 | 3.4% |
| Other values (3) | 3243 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 31349 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 4324 | |
| T | 4324 | |
| I | 3243 | |
| A | 3243 | |
| S | 3243 | |
| R | 2162 | |
| L | 2162 | |
| N | 2162 | |
| C | 2162 | |
| B | 1081 | 3.4% |
| Other values (3) | 3243 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31349 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 4324 | |
| T | 4324 | |
| I | 3243 | |
| A | 3243 | |
| S | 3243 | |
| R | 2162 | |
| L | 2162 | |
| N | 2162 | |
| C | 2162 | |
| B | 1081 | 3.4% |
| Other values (3) | 3243 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| Source.Name | DATE | SERIES | OPEN | HIGH | LOW | PREV. CLOSE | LTP | CLOSE | VWAP | 52W H | 52W L | VOLUME | VALUE | NO OF TRADES | SYMBOL | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | RELIANCE_Data.csv | 16-05-2022 | EQ | 2434.45 | 2481.00 | 2416.65 | 2426.60 | 2428.05 | 2427.20 | 2444.28 | 2856.15 | 1930.4 | 6201594 | 1.515841e+10 | 244925 | RELIANCE |
| 1 | RELIANCE_Data.csv | 13-05-2022 | EQ | 2426.00 | 2478.00 | 2415.35 | 2399.40 | 2431.45 | 2426.60 | 2451.67 | 2856.15 | 1906.0 | 8910998 | 2.184680e+10 | 408746 | RELIANCE |
| 2 | RELIANCE_Data.csv | 12-05-2022 | EQ | 2427.50 | 2434.85 | 2370.00 | 2449.30 | 2403.50 | 2399.40 | 2400.60 | 2856.15 | 1906.0 | 9456280 | 2.270075e+10 | 359540 | RELIANCE |
| 3 | RELIANCE_Data.csv | 11-05-2022 | EQ | 2472.65 | 2484.95 | 2421.95 | 2474.65 | 2450.75 | 2449.30 | 2454.29 | 2856.15 | 1906.0 | 7681157 | 1.885176e+10 | 325039 | RELIANCE |
| 4 | RELIANCE_Data.csv | 10-05-2022 | EQ | 2495.00 | 2526.60 | 2458.00 | 2518.30 | 2461.70 | 2474.65 | 2495.14 | 2856.15 | 1906.0 | 9004636 | 2.246785e+10 | 329083 | RELIANCE |
| 5 | RELIANCE_Data.csv | 09-05-2022 | EQ | 2574.95 | 2597.10 | 2507.00 | 2620.65 | 2508.00 | 2518.30 | 2540.75 | 2856.15 | 1906.0 | 8345649 | 2.120422e+10 | 344258 | RELIANCE |
| 6 | RELIANCE_Data.csv | 06-05-2022 | EQ | 2612.20 | 2659.00 | 2593.55 | 2640.90 | 2628.00 | 2620.65 | 2619.88 | 2856.15 | 1906.0 | 9068448 | 2.375828e+10 | 291431 | RELIANCE |
| 7 | RELIANCE_Data.csv | 05-05-2022 | EQ | 2723.50 | 2730.00 | 2632.00 | 2693.65 | 2643.00 | 2640.90 | 2677.81 | 2856.15 | 1906.0 | 7942721 | 2.126910e+10 | 256514 | RELIANCE |
| 8 | RELIANCE_Data.csv | 04-05-2022 | EQ | 2785.00 | 2790.00 | 2676.30 | 2780.45 | 2692.00 | 2693.65 | 2728.03 | 2856.15 | 1906.0 | 8882792 | 2.423248e+10 | 277638 | RELIANCE |
| 9 | RELIANCE_Data.csv | 02-05-2022 | EQ | 2762.00 | 2805.50 | 2758.05 | 2790.25 | 2780.90 | 2780.45 | 2783.29 | 2856.15 | 1906.0 | 4369022 | 1.216027e+10 | 189251 | RELIANCE |
Last rows
| Source.Name | DATE | SERIES | OPEN | HIGH | LOW | PREV. CLOSE | LTP | CLOSE | VWAP | 52W H | 52W L | VOLUME | VALUE | NO OF TRADES | SYMBOL | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5395 | WIPRO_Data.csv | 12-01-2018 | EQ | 321.00 | 323.75 | 317.20 | 321.10 | 320.00 | 318.80 | 319.36 | 568.0 | 252.0 | 3162664 | 1.010016e+09 | 31980 | WIPRO |
| 5396 | WIPRO_Data.csv | 11-01-2018 | EQ | 323.55 | 326.05 | 319.00 | 326.70 | 320.05 | 321.10 | 322.79 | 568.0 | 252.0 | 2116462 | 6.831816e+08 | 24600 | WIPRO |
| 5397 | WIPRO_Data.csv | 10-01-2018 | EQ | 316.40 | 327.85 | 315.10 | 317.20 | 326.40 | 326.70 | 322.70 | 568.0 | 252.0 | 3811138 | 1.229866e+09 | 36772 | WIPRO |
| 5398 | WIPRO_Data.csv | 09-01-2018 | EQ | 312.60 | 320.00 | 306.40 | 311.15 | 315.30 | 317.20 | 313.19 | 568.0 | 252.0 | 4575572 | 1.433025e+09 | 46168 | WIPRO |
| 5399 | WIPRO_Data.csv | 08-01-2018 | EQ | 310.00 | 312.90 | 309.10 | 309.55 | 311.80 | 311.15 | 310.76 | 568.0 | 252.0 | 1785248 | 5.547758e+08 | 26106 | WIPRO |
| 5400 | WIPRO_Data.csv | 05-01-2018 | EQ | 313.00 | 313.90 | 307.70 | 311.65 | 310.05 | 309.55 | 311.03 | 568.0 | 252.0 | 1613205 | 5.017510e+08 | 23732 | WIPRO |
| 5401 | WIPRO_Data.csv | 04-01-2018 | EQ | 310.10 | 313.00 | 307.45 | 309.95 | 310.60 | 311.65 | 309.71 | 568.0 | 252.0 | 1464584 | 4.535995e+08 | 16466 | WIPRO |
| 5402 | WIPRO_Data.csv | 03-01-2018 | EQ | 320.40 | 320.40 | 308.75 | 318.70 | 309.85 | 309.95 | 315.06 | 568.0 | 252.0 | 2197677 | 6.923997e+08 | 42609 | WIPRO |
| 5403 | WIPRO_Data.csv | 02-01-2018 | EQ | 315.85 | 324.00 | 314.45 | 316.55 | 316.50 | 318.70 | 318.70 | 568.0 | 252.0 | 2874518 | 9.160983e+08 | 29055 | WIPRO |
| 5404 | WIPRO_Data.csv | 01-01-2018 | EQ | 311.50 | 320.00 | 309.45 | 314.25 | 314.80 | 316.55 | 316.44 | 568.0 | 252.0 | 3350397 | 1.060197e+09 | 27501 | WIPRO |